Overview
Brought to you by YData
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 5037 |
| Missing cells | 1772 |
| Missing cells (%) | 2.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 629.6 KiB |
| Average record size in memory | 128.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Text | 3 |
| Categorical | 1 |
| DateTime | 1 |
id is highly overall correlated with number_of_reviews | High correlation |
latitude is highly overall correlated with longitude | High correlation |
longitude is highly overall correlated with latitude | High correlation |
number_of_reviews is highly overall correlated with id and 1 other fields | High correlation |
reviews_per_month is highly overall correlated with number_of_reviews | High correlation |
last_review has 886 (17.6%) missing values | Missing |
reviews_per_month has 886 (17.6%) missing values | Missing |
price is highly skewed (γ1 = 21.51400656) | Skewed |
id has unique values | Unique |
number_of_reviews has 886 (17.6%) zeros | Zeros |
availability_365 has 948 (18.8%) zeros | Zeros |
Reproduction
| Analysis started | 2025-03-01 00:39:42.877013 |
|---|---|
| Analysis finished | 2025-03-01 00:39:49.460535 |
| Duration | 6.58 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
id
Real number (ℝ)
High correlation  Unique 
| Distinct | 5037 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28206886 |
| Minimum | 4505 |
|---|---|
| Maximum | 45515581 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 4505 |
|---|---|
| 5-th percentile | 4173200.2 |
| Q1 | 19087342 |
| median | 30010581 |
| Q3 | 39519693 |
| 95-th percentile | 44355959 |
| Maximum | 45515581 |
| Range | 45511076 |
| Interquartile range (IQR) | 20432351 |
Descriptive statistics
| Standard deviation | 12779971 |
|---|---|
| Coefficient of variation (CV) | 0.45307983 |
| Kurtosis | -0.87737091 |
| Mean | 28206886 |
| Median Absolute Deviation (MAD) | 9961534 |
| Skewness | -0.49974578 |
| Sum | 1.4207808 × 1011 |
| Variance | 1.6332766 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 42951790 | 1 | < 0.1% |
| 21174388 | 1 | < 0.1% |
| 42670833 | 1 | < 0.1% |
| 7564825 | 1 | < 0.1% |
| 9660334 | 1 | < 0.1% |
| 18059552 | 1 | < 0.1% |
| 33810838 | 1 | < 0.1% |
| 9505429 | 1 | < 0.1% |
| 35926860 | 1 | < 0.1% |
| 24628815 | 1 | < 0.1% |
| Other values (5027) | 5027 |
| Value | Count | Frequency (%) |
| 4505 | 1 | |
| 7126 | 1 | |
| 9811 | 1 | |
| 10945 | 1 | |
| 12068 | 1 | |
| 12140 | 1 | |
| 22362 | 1 | |
| 24833 | 1 | |
| 25879 | 1 | |
| 28749 | 1 |
| Value | Count | Frequency (%) |
| 45515581 | 1 | |
| 45515281 | 1 | |
| 45514632 | 1 | |
| 45514091 | 1 | |
| 45513842 | 1 | |
| 45512087 | 1 | |
| 45507485 | 1 | |
| 45504480 | 1 | |
| 45495441 | 1 | |
| 45494553 | 1 |
name
Text
| Distinct | 4913 |
|---|---|
| Distinct (%) | 97.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.7 KiB |
Length
| Max length | 206 |
|---|---|
| Median length | 100 |
| Mean length | 41.284495 |
| Min length | 2 |
Unique
| Unique | 4847 ? |
|---|---|
| Unique (%) | 96.2% |
Sample
| 1st row | Best deal in the area |
|---|---|
| 2nd row | whole apartment kitchen **1 bed/bath **living room |
| 3rd row | Logan Square 3 bedrms with parking |
| 4th row | Vintage Charm Near Lake |
| 5th row | Study Oasis |
| Value | Count | Frequency (%) |
| 1699 | 4.9% | |
| in | 1172 | 3.4% |
| chicago | 575 | 1.7% |
| private | 561 | 1.6% |
| park | 540 | 1.6% |
| room | 539 | 1.6% |
| to | 528 | 1.5% |
| the | 502 | 1.4% |
| apartment | 482 | 1.4% |
| bedroom | 474 | 1.4% |
| Other values (3572) | 27606 |
Most occurring characters
| Value | Count | Frequency (%) |
| 29700 | 14.3% | |
| e | 14689 | 7.1% |
| o | 14227 | 6.8% |
| a | 10980 | 5.3% |
| i | 10592 | 5.1% |
| t | 10391 | 5.0% |
| n | 10103 | 4.9% |
| r | 9986 | 4.8% |
| l | 5600 | 2.7% |
| s | 4815 | 2.3% |
| Other values (304) | 86867 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 207950 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 29700 | 14.3% | |
| e | 14689 | 7.1% |
| o | 14227 | 6.8% |
| a | 10980 | 5.3% |
| i | 10592 | 5.1% |
| t | 10391 | 5.0% |
| n | 10103 | 4.9% |
| r | 9986 | 4.8% |
| l | 5600 | 2.7% |
| s | 4815 | 2.3% |
| Other values (304) | 86867 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 207950 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 29700 | 14.3% | |
| e | 14689 | 7.1% |
| o | 14227 | 6.8% |
| a | 10980 | 5.3% |
| i | 10592 | 5.1% |
| t | 10391 | 5.0% |
| n | 10103 | 4.9% |
| r | 9986 | 4.8% |
| l | 5600 | 2.7% |
| s | 4815 | 2.3% |
| Other values (304) | 86867 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 207950 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 29700 | 14.3% | |
| e | 14689 | 7.1% |
| o | 14227 | 6.8% |
| a | 10980 | 5.3% |
| i | 10592 | 5.1% |
| t | 10391 | 5.0% |
| n | 10103 | 4.9% |
| r | 9986 | 4.8% |
| l | 5600 | 2.7% |
| s | 4815 | 2.3% |
| Other values (304) | 86867 |
host_id
Real number (ℝ)
| Distinct | 2924 |
|---|---|
| Distinct (%) | 58.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 99335943 |
| Minimum | 2140 |
|---|---|
| Maximum | 3.6790706 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 2140 |
|---|---|
| 5-th percentile | 1597503 |
| Q1 | 17352266 |
| median | 58773353 |
| Q3 | 1.6348161 × 108 |
| 95-th percentile | 3.1407557 × 108 |
| Maximum | 3.6790706 × 108 |
| Range | 3.6790492 × 108 |
| Interquartile range (IQR) | 1.4612934 × 108 |
Descriptive statistics
| Standard deviation | 1.0003163 × 108 |
|---|---|
| Coefficient of variation (CV) | 1.0070034 |
| Kurtosis | -0.060172575 |
| Mean | 99335943 |
| Median Absolute Deviation (MAD) | 50238891 |
| Skewness | 1.0218632 |
| Sum | 5.0035515 × 1011 |
| Variance | 1.0006328 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 107434423 | 156 | 3.1% |
| 3965428 | 57 | 1.1% |
| 47172572 | 52 | 1.0% |
| 8534462 | 39 | 0.8% |
| 359234447 | 36 | 0.7% |
| 12243051 | 36 | 0.7% |
| 88566861 | 36 | 0.7% |
| 9094538 | 31 | 0.6% |
| 170785489 | 28 | 0.6% |
| 100782278 | 27 | 0.5% |
| Other values (2914) | 4539 |
| Value | Count | Frequency (%) |
| 2140 | 3 | |
| 2153 | 2 | |
| 2745 | 1 | < 0.1% |
| 4434 | 4 | |
| 5775 | 1 | < 0.1% |
| 6162 | 1 | < 0.1% |
| 9301 | 1 | < 0.1% |
| 11278 | 1 | < 0.1% |
| 13014 | 1 | < 0.1% |
| 17928 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 367907062 | 3 | |
| 366601024 | 1 | < 0.1% |
| 366264178 | 1 | < 0.1% |
| 365996898 | 1 | < 0.1% |
| 365764996 | 1 | < 0.1% |
| 365131679 | 1 | < 0.1% |
| 365089948 | 1 | < 0.1% |
| 363488081 | 1 | < 0.1% |
| 363370322 | 1 | < 0.1% |
| 362355487 | 1 | < 0.1% |
host_name
Text
| Distinct | 1616 |
|---|---|
| Distinct (%) | 32.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.7 KiB |
Length
| Max length | 35 |
|---|---|
| Median length | 32 |
| Mean length | 6.5215406 |
| Min length | 1 |
Unique
| Unique | 919 ? |
|---|---|
| Unique (%) | 18.2% |
Sample
| 1st row | William |
|---|---|
| 2nd row | Jeanne |
| 3rd row | Thanh |
| 4th row | Barbara |
| 5th row | Aj |
| Value | Count | Frequency (%) |
| 238 | 3.9% | |
| blueground | 156 | 2.6% |
| and | 100 | 1.6% |
| rob | 67 | 1.1% |
| john | 57 | 0.9% |
| michael | 56 | 0.9% |
| alex | 54 | 0.9% |
| zencity | 52 | 0.9% |
| nicole | 45 | 0.7% |
| david | 45 | 0.7% |
| Other values (1594) | 5208 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3597 | 11.0% |
| e | 3000 | 9.1% |
| n | 2649 | 8.1% |
| i | 2389 | 7.3% |
| r | 1871 | 5.7% |
| o | 1705 | 5.2% |
| l | 1655 | 5.0% |
| t | 1206 | 3.7% |
| 1049 | 3.2% | |
| d | 939 | 2.9% |
| Other values (65) | 12789 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 32849 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 3597 | 11.0% |
| e | 3000 | 9.1% |
| n | 2649 | 8.1% |
| i | 2389 | 7.3% |
| r | 1871 | 5.7% |
| o | 1705 | 5.2% |
| l | 1655 | 5.0% |
| t | 1206 | 3.7% |
| 1049 | 3.2% | |
| d | 939 | 2.9% |
| Other values (65) | 12789 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 32849 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 3597 | 11.0% |
| e | 3000 | 9.1% |
| n | 2649 | 8.1% |
| i | 2389 | 7.3% |
| r | 1871 | 5.7% |
| o | 1705 | 5.2% |
| l | 1655 | 5.0% |
| t | 1206 | 3.7% |
| 1049 | 3.2% | |
| d | 939 | 2.9% |
| Other values (65) | 12789 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 32849 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 3597 | 11.0% |
| e | 3000 | 9.1% |
| n | 2649 | 8.1% |
| i | 2389 | 7.3% |
| r | 1871 | 5.7% |
| o | 1705 | 5.2% |
| l | 1655 | 5.0% |
| t | 1206 | 3.7% |
| 1049 | 3.2% | |
| d | 939 | 2.9% |
| Other values (65) | 12789 |
neighbourhood
Text
| Distinct | 77 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.7 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 15 |
| Mean length | 11.035537 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Gage Park |
|---|---|
| 2nd row | North Center |
| 3rd row | Logan Square |
| 4th row | Rogers Park |
| 5th row | South Shore |
| Value | Count | Frequency (%) |
| west | 1064 | 10.5% |
| side | 1063 | 10.4% |
| park | 973 | 9.6% |
| near | 909 | 8.9% |
| north | 690 | 6.8% |
| town | 568 | 5.6% |
| lake | 422 | 4.1% |
| view | 422 | 4.1% |
| square | 420 | 4.1% |
| lincoln | 328 | 3.2% |
| Other values (71) | 3315 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 6123 | 11.0% |
| 5137 | 9.2% | |
| r | 4586 | 8.3% |
| a | 4181 | 7.5% |
| o | 4167 | 7.5% |
| t | 2873 | 5.2% |
| n | 2714 | 4.9% |
| i | 2442 | 4.4% |
| d | 2087 | 3.8% |
| S | 1797 | 3.2% |
| Other values (36) | 19479 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 55586 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 6123 | 11.0% |
| 5137 | 9.2% | |
| r | 4586 | 8.3% |
| a | 4181 | 7.5% |
| o | 4167 | 7.5% |
| t | 2873 | 5.2% |
| n | 2714 | 4.9% |
| i | 2442 | 4.4% |
| d | 2087 | 3.8% |
| S | 1797 | 3.2% |
| Other values (36) | 19479 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 55586 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 6123 | 11.0% |
| 5137 | 9.2% | |
| r | 4586 | 8.3% |
| a | 4181 | 7.5% |
| o | 4167 | 7.5% |
| t | 2873 | 5.2% |
| n | 2714 | 4.9% |
| i | 2442 | 4.4% |
| d | 2087 | 3.8% |
| S | 1797 | 3.2% |
| Other values (36) | 19479 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 55586 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 6123 | 11.0% |
| 5137 | 9.2% | |
| r | 4586 | 8.3% |
| a | 4181 | 7.5% |
| o | 4167 | 7.5% |
| t | 2873 | 5.2% |
| n | 2714 | 4.9% |
| i | 2442 | 4.4% |
| d | 2087 | 3.8% |
| S | 1797 | 3.2% |
| Other values (36) | 19479 |
latitude
Real number (ℝ)
High correlation 
| Distinct | 4244 |
|---|---|
| Distinct (%) | 84.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.899259 |
| Minimum | 41.64736 |
|---|---|
| Maximum | 42.02251 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 41.64736 |
|---|---|
| 5-th percentile | 41.782918 |
| Q1 | 41.8734 |
| median | 41.90191 |
| Q3 | 41.93976 |
| 95-th percentile | 41.987868 |
| Maximum | 42.02251 |
| Range | 0.37515 |
| Interquartile range (IQR) | 0.06636 |
Descriptive statistics
| Standard deviation | 0.058928609 |
|---|---|
| Coefficient of variation (CV) | 0.0014064356 |
| Kurtosis | 0.81845286 |
| Mean | 41.899259 |
| Median Absolute Deviation (MAD) | 0.03435 |
| Skewness | -0.73230955 |
| Sum | 211046.57 |
| Variance | 0.0034725809 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 41.88306 | 19 | 0.4% |
| 41.89111 | 15 | 0.3% |
| 41.88608 | 14 | 0.3% |
| 41.89063 | 13 | 0.3% |
| 41.88558 | 11 | 0.2% |
| 41.92819 | 11 | 0.2% |
| 41.89862 | 8 | 0.2% |
| 41.88652 | 7 | 0.1% |
| 41.8989 | 7 | 0.1% |
| 41.89235 | 7 | 0.1% |
| Other values (4234) | 4925 |
| Value | Count | Frequency (%) |
| 41.64736 | 1 | |
| 41.65208 | 1 | |
| 41.65388 | 1 | |
| 41.65578 | 1 | |
| 41.65977 | 1 | |
| 41.68289 | 1 | |
| 41.68612 | 1 | |
| 41.68664 | 1 | |
| 41.6883 | 1 | |
| 41.68906 | 1 |
| Value | Count | Frequency (%) |
| 42.02251 | 1 | |
| 42.02139 | 1 | |
| 42.02119 | 1 | |
| 42.02105 | 1 | |
| 42.02087 | 1 | |
| 42.02077 | 1 | |
| 42.02042 | 1 | |
| 42.01957 | 1 | |
| 42.01947 | 1 | |
| 42.01926 | 1 |
longitude
Real number (ℝ)
High correlation 
| Distinct | 4052 |
|---|---|
| Distinct (%) | 80.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -87.663981 |
| Minimum | -87.84681 |
|---|---|
| Maximum | -87.53752 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 5037 |
| Negative (%) | 100.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | -87.84681 |
|---|---|
| 5-th percentile | -87.736012 |
| Q1 | -87.68671 |
| median | -87.66088 |
| Q3 | -87.63335 |
| 95-th percentile | -87.604238 |
| Maximum | -87.53752 |
| Range | 0.30929 |
| Interquartile range (IQR) | 0.05336 |
Descriptive statistics
| Standard deviation | 0.042619423 |
|---|---|
| Coefficient of variation (CV) | -0.000486168 |
| Kurtosis | 1.2988823 |
| Mean | -87.663981 |
| Median Absolute Deviation (MAD) | 0.027 |
| Skewness | -0.67843934 |
| Sum | -441563.47 |
| Variance | 0.0018164152 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -87.65131 | 17 | 0.3% |
| -87.62205 | 15 | 0.3% |
| -87.63422 | 15 | 0.3% |
| -87.61903 | 14 | 0.3% |
| -87.6525 | 13 | 0.3% |
| -87.6257 | 12 | 0.2% |
| -87.62797 | 8 | 0.2% |
| -87.62472 | 7 | 0.1% |
| -87.63337 | 7 | 0.1% |
| -87.65159 | 7 | 0.1% |
| Other values (4042) | 4922 |
| Value | Count | Frequency (%) |
| -87.84681 | 1 | |
| -87.84527 | 1 | |
| -87.84474 | 1 | |
| -87.84363 | 1 | |
| -87.84196 | 1 | |
| -87.84193 | 1 | |
| -87.84012 | 1 | |
| -87.83983 | 1 | |
| -87.83528 | 1 | |
| -87.83526 | 2 |
| Value | Count | Frequency (%) |
| -87.53752 | 1 | |
| -87.5379 | 1 | |
| -87.54496 | 1 | |
| -87.54557 | 1 | |
| -87.54593 | 1 | |
| -87.54595 | 1 | |
| -87.54596 | 2 | |
| -87.54603 | 1 | |
| -87.54615 | 1 | |
| -87.54775 | 1 |
room_type
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 78.7 KiB |
| Entire home/apt | |
|---|---|
| Private room | |
| Shared room | 75 |
| Hotel room | 56 |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 14.022434 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Private room |
|---|---|
| 2nd row | Entire home/apt |
| 3rd row | Entire home/apt |
| 4th row | Entire home/apt |
| 5th row | Private room |
Common Values
| Value | Count | Frequency (%) |
| Entire home/apt | 3458 | |
| Private room | 1448 | |
| Shared room | 75 | 1.5% |
| Hotel room | 56 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| entire | 3458 | |
| home/apt | 3458 | |
| room | 1579 | |
| private | 1448 | |
| shared | 75 | 0.7% |
| hotel | 56 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8495 | |
| t | 8420 | |
| o | 6672 | |
| r | 6560 | |
| 5037 | 7.1% | |
| m | 5037 | 7.1% |
| a | 4981 | 7.1% |
| i | 4906 | 6.9% |
| h | 3533 | 5.0% |
| n | 3458 | 4.9% |
| Other values (9) | 13532 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 70631 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 8495 | |
| t | 8420 | |
| o | 6672 | |
| r | 6560 | |
| 5037 | 7.1% | |
| m | 5037 | 7.1% |
| a | 4981 | 7.1% |
| i | 4906 | 6.9% |
| h | 3533 | 5.0% |
| n | 3458 | 4.9% |
| Other values (9) | 13532 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 70631 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 8495 | |
| t | 8420 | |
| o | 6672 | |
| r | 6560 | |
| 5037 | 7.1% | |
| m | 5037 | 7.1% |
| a | 4981 | 7.1% |
| i | 4906 | 6.9% |
| h | 3533 | 5.0% |
| n | 3458 | 4.9% |
| Other values (9) | 13532 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 70631 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 8495 | |
| t | 8420 | |
| o | 6672 | |
| r | 6560 | |
| 5037 | 7.1% | |
| m | 5037 | 7.1% |
| a | 4981 | 7.1% |
| i | 4906 | 6.9% |
| h | 3533 | 5.0% |
| n | 3458 | 4.9% |
| Other values (9) | 13532 |
price
Real number (ℝ)
Skewed 
| Distinct | 454 |
|---|---|
| Distinct (%) | 9.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 150.98015 |
| Minimum | 0 |
|---|---|
| Maximum | 9999 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 31 |
| Q1 | 65 |
| median | 99 |
| Q3 | 155 |
| 95-th percentile | 399 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 90 |
Descriptive statistics
| Standard deviation | 364.92497 |
|---|---|
| Coefficient of variation (CV) | 2.4170394 |
| Kurtosis | 553.10462 |
| Mean | 150.98015 |
| Median Absolute Deviation (MAD) | 43 |
| Skewness | 21.514007 |
| Sum | 760487 |
| Variance | 133170.23 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 110 | 2.2% |
| 80 | 101 | 2.0% |
| 75 | 99 | 2.0% |
| 150 | 97 | 1.9% |
| 100 | 88 | 1.7% |
| 65 | 87 | 1.7% |
| 70 | 84 | 1.7% |
| 60 | 77 | 1.5% |
| 90 | 76 | 1.5% |
| 125 | 75 | 1.5% |
| Other values (444) | 4143 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 10 | 3 | 0.1% |
| 13 | 1 | < 0.1% |
| 14 | 2 | < 0.1% |
| 15 | 6 | |
| 16 | 2 | < 0.1% |
| 17 | 2 | < 0.1% |
| 18 | 4 | |
| 19 | 4 | |
| 20 | 8 |
| Value | Count | Frequency (%) |
| 9999 | 5 | |
| 7000 | 1 | < 0.1% |
| 3429 | 1 | < 0.1% |
| 3070 | 1 | < 0.1% |
| 3000 | 1 | < 0.1% |
| 2788 | 1 | < 0.1% |
| 1999 | 1 | < 0.1% |
| 1921 | 1 | < 0.1% |
| 1828 | 1 | < 0.1% |
| 1500 | 3 |
minimum_nights
Real number (ℝ)
| Distinct | 55 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.0883462 |
| Minimum | 1 |
|---|---|
| Maximum | 500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 31 |
| Maximum | 500 |
| Range | 499 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 22.371359 |
|---|---|
| Coefficient of variation (CV) | 2.7658755 |
| Kurtosis | 160.42948 |
| Mean | 8.0883462 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 10.584917 |
| Sum | 40741 |
| Variance | 500.4777 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1630 | |
| 1 | 1528 | |
| 3 | 685 | |
| 30 | 289 | 5.7% |
| 4 | 135 | 2.7% |
| 31 | 109 | 2.2% |
| 7 | 105 | 2.1% |
| 32 | 87 | 1.7% |
| 5 | 85 | 1.7% |
| 14 | 56 | 1.1% |
| Other values (45) | 328 | 6.5% |
| Value | Count | Frequency (%) |
| 1 | 1528 | |
| 2 | 1630 | |
| 3 | 685 | |
| 4 | 135 | 2.7% |
| 5 | 85 | 1.7% |
| 6 | 25 | 0.5% |
| 7 | 105 | 2.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 10 | 33 | 0.7% |
| Value | Count | Frequency (%) |
| 500 | 1 | < 0.1% |
| 365 | 8 | |
| 240 | 1 | < 0.1% |
| 210 | 1 | < 0.1% |
| 200 | 1 | < 0.1% |
| 185 | 1 | < 0.1% |
| 182 | 1 | < 0.1% |
| 180 | 7 | |
| 179 | 1 | < 0.1% |
| 150 | 1 | < 0.1% |
number_of_reviews
Real number (ℝ)
High correlation  Zeros 
| Distinct | 308 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.045265 |
| Minimum | 0 |
|---|---|
| Maximum | 632 |
| Zeros | 886 |
| Zeros (%) | 17.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 15 |
| Q3 | 55 |
| 95-th percentile | 173.2 |
| Maximum | 632 |
| Range | 632 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 64.638322 |
|---|---|
| Coefficient of variation (CV) | 1.5373508 |
| Kurtosis | 11.387585 |
| Mean | 42.045265 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | 2.8142796 |
| Sum | 211782 |
| Variance | 4178.1127 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 886 | 17.6% |
| 1 | 326 | 6.5% |
| 2 | 228 | 4.5% |
| 3 | 186 | 3.7% |
| 5 | 118 | 2.3% |
| 4 | 117 | 2.3% |
| 7 | 102 | 2.0% |
| 6 | 83 | 1.6% |
| 9 | 78 | 1.5% |
| 8 | 75 | 1.5% |
| Other values (298) | 2838 |
| Value | Count | Frequency (%) |
| 0 | 886 | |
| 1 | 326 | 6.5% |
| 2 | 228 | 4.5% |
| 3 | 186 | 3.7% |
| 4 | 117 | 2.3% |
| 5 | 118 | 2.3% |
| 6 | 83 | 1.6% |
| 7 | 102 | 2.0% |
| 8 | 75 | 1.5% |
| 9 | 78 | 1.5% |
| Value | Count | Frequency (%) |
| 632 | 1 | |
| 625 | 1 | |
| 541 | 1 | |
| 511 | 1 | |
| 506 | 1 | |
| 500 | 1 | |
| 499 | 2 | |
| 488 | 1 | |
| 461 | 1 | |
| 442 | 1 |
last_review
Date
Missing 
| Distinct | 680 |
|---|---|
| Distinct (%) | 16.4% |
| Missing | 886 |
| Missing (%) | 17.6% |
| Memory size | 78.7 KiB |
| Minimum | 2013-08-18 00:00:00 |
|---|---|
| Maximum | 2020-09-21 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
reviews_per_month
Real number (ℝ)
High correlation  Missing 
| Distinct | 627 |
|---|---|
| Distinct (%) | 15.1% |
| Missing | 886 |
| Missing (%) | 17.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7346567 |
| Minimum | 0.02 |
|---|---|
| Maximum | 32.43 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 0.02 |
|---|---|
| 5-th percentile | 0.09 |
| Q1 | 0.43 |
| median | 1.24 |
| Q3 | 2.56 |
| 95-th percentile | 4.9 |
| Maximum | 32.43 |
| Range | 32.41 |
| Interquartile range (IQR) | 2.13 |
Descriptive statistics
| Standard deviation | 1.7247189 |
|---|---|
| Coefficient of variation (CV) | 0.994271 |
| Kurtosis | 27.811089 |
| Mean | 1.7346567 |
| Median Absolute Deviation (MAD) | 0.94 |
| Skewness | 2.8437583 |
| Sum | 7200.56 |
| Variance | 2.9746552 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 59 | 1.2% |
| 0.17 | 49 | 1.0% |
| 0.14 | 43 | 0.9% |
| 0.08 | 41 | 0.8% |
| 0.16 | 36 | 0.7% |
| 0.03 | 33 | 0.7% |
| 0.09 | 33 | 0.7% |
| 0.12 | 32 | 0.6% |
| 0.22 | 32 | 0.6% |
| 0.19 | 29 | 0.6% |
| Other values (617) | 3764 | |
| (Missing) | 886 | 17.6% |
| Value | Count | Frequency (%) |
| 0.02 | 12 | 0.2% |
| 0.03 | 33 | |
| 0.04 | 27 | |
| 0.05 | 20 | |
| 0.06 | 23 | |
| 0.07 | 22 | |
| 0.08 | 41 | |
| 0.09 | 33 | |
| 0.1 | 23 | |
| 0.11 | 28 |
| Value | Count | Frequency (%) |
| 32.43 | 1 | |
| 16.93 | 1 | |
| 11.69 | 1 | |
| 11.54 | 1 | |
| 11.15 | 1 | |
| 11.07 | 1 | |
| 10.97 | 1 | |
| 10.86 | 1 | |
| 10.51 | 1 | |
| 9.52 | 1 |
calculated_host_listings_count
Real number (ℝ)
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.482629 |
| Minimum | 1 |
|---|---|
| Maximum | 205 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 7 |
| 95-th percentile | 62 |
| Maximum | 205 |
| Range | 204 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 36.67142 |
|---|---|
| Coefficient of variation (CV) | 2.7199014 |
| Kurtosis | 20.207855 |
| Mean | 13.482629 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.4866916 |
| Sum | 67912 |
| Variance | 1344.793 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2116 | |
| 2 | 676 | 13.4% |
| 3 | 357 | 7.1% |
| 4 | 297 | 5.9% |
| 205 | 156 | 3.1% |
| 5 | 149 | 3.0% |
| 6 | 102 | 2.0% |
| 7 | 101 | 2.0% |
| 9 | 94 | 1.9% |
| 8 | 78 | 1.5% |
| Other values (23) | 911 |
| Value | Count | Frequency (%) |
| 1 | 2116 | |
| 2 | 676 | 13.4% |
| 3 | 357 | 7.1% |
| 4 | 297 | 5.9% |
| 5 | 149 | 3.0% |
| 6 | 102 | 2.0% |
| 7 | 101 | 2.0% |
| 8 | 78 | 1.5% |
| 9 | 94 | 1.9% |
| 10 | 58 | 1.2% |
| Value | Count | Frequency (%) |
| 205 | 156 | |
| 73 | 57 | 1.1% |
| 62 | 52 | 1.0% |
| 47 | 75 | |
| 45 | 36 | 0.7% |
| 44 | 36 | 0.7% |
| 37 | 31 | 0.6% |
| 31 | 52 | 1.0% |
| 30 | 24 | 0.5% |
| 28 | 27 | 0.5% |
availability_365
Real number (ℝ)
Zeros 
| Distinct | 361 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 173.55906 |
| Minimum | 0 |
|---|---|
| Maximum | 365 |
| Zeros | 948 |
| Zeros (%) | 18.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 78.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 34 |
| median | 159 |
| Q3 | 329 |
| 95-th percentile | 365 |
| Maximum | 365 |
| Range | 365 |
| Interquartile range (IQR) | 295 |
Descriptive statistics
| Standard deviation | 138.89201 |
|---|---|
| Coefficient of variation (CV) | 0.8002579 |
| Kurtosis | -1.5528595 |
| Mean | 173.55906 |
| Median Absolute Deviation (MAD) | 152 |
| Skewness | 0.12234251 |
| Sum | 874217 |
| Variance | 19290.991 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 948 | 18.8% |
| 365 | 293 | 5.8% |
| 364 | 127 | 2.5% |
| 90 | 87 | 1.7% |
| 89 | 75 | 1.5% |
| 263 | 71 | 1.4% |
| 179 | 63 | 1.3% |
| 355 | 57 | 1.1% |
| 180 | 57 | 1.1% |
| 363 | 52 | 1.0% |
| Other values (351) | 3207 |
| Value | Count | Frequency (%) |
| 0 | 948 | |
| 1 | 43 | 0.9% |
| 2 | 10 | 0.2% |
| 3 | 22 | 0.4% |
| 4 | 15 | 0.3% |
| 5 | 8 | 0.2% |
| 6 | 10 | 0.2% |
| 7 | 12 | 0.2% |
| 8 | 4 | 0.1% |
| 9 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 365 | 293 | |
| 364 | 127 | |
| 363 | 52 | 1.0% |
| 362 | 50 | 1.0% |
| 361 | 29 | 0.6% |
| 360 | 49 | 1.0% |
| 359 | 36 | 0.7% |
| 358 | 29 | 0.6% |
| 357 | 21 | 0.4% |
| 356 | 22 | 0.4% |
Interactions
Correlations
| availability_365 | calculated_host_listings_count | host_id | id | latitude | longitude | minimum_nights | number_of_reviews | price | reviews_per_month | room_type | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| availability_365 | 1.000 | 0.277 | -0.016 | 0.004 | -0.070 | 0.078 | 0.179 | -0.029 | 0.115 | 0.018 | 0.083 |
| calculated_host_listings_count | 0.277 | 1.000 | 0.033 | 0.227 | -0.185 | 0.295 | 0.152 | -0.212 | 0.038 | -0.037 | 0.145 |
| host_id | -0.016 | 0.033 | 1.000 | 0.471 | -0.126 | 0.054 | -0.098 | -0.244 | -0.009 | 0.082 | 0.107 |
| id | 0.004 | 0.227 | 0.471 | 1.000 | -0.146 | 0.120 | 0.052 | -0.640 | 0.005 | -0.005 | 0.101 |
| latitude | -0.070 | -0.185 | -0.126 | -0.146 | 1.000 | -0.522 | -0.089 | 0.129 | 0.082 | 0.032 | 0.151 |
| longitude | 0.078 | 0.295 | 0.054 | 0.120 | -0.522 | 1.000 | 0.137 | -0.188 | 0.188 | -0.088 | 0.105 |
| minimum_nights | 0.179 | 0.152 | -0.098 | 0.052 | -0.089 | 0.137 | 1.000 | -0.274 | 0.135 | -0.260 | 0.054 |
| number_of_reviews | -0.029 | -0.212 | -0.244 | -0.640 | 0.129 | -0.188 | -0.274 | 1.000 | -0.109 | 0.791 | 0.000 |
| price | 0.115 | 0.038 | -0.009 | 0.005 | 0.082 | 0.188 | 0.135 | -0.109 | 1.000 | -0.089 | 0.103 |
| reviews_per_month | 0.018 | -0.037 | 0.082 | -0.005 | 0.032 | -0.088 | -0.260 | 0.791 | -0.089 | 1.000 | 0.101 |
| room_type | 0.083 | 0.145 | 0.107 | 0.101 | 0.151 | 0.105 | 0.054 | 0.000 | 0.103 | 0.101 | 1.000 |
Missing values
Sample
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1930 | 21174388 | Best deal in the area | 43392136 | William | Gage Park | 41.79359 | -87.70345 | Private room | 46 | 1 | 162 | 2020-09-11 | 4.50 | 3 | 364 |
| 5588 | 42670833 | whole apartment kitchen **1 bed/bath **living room | 110299543 | Jeanne | North Center | 41.95803 | -87.69214 | Entire home/apt | 120 | 2 | 0 | NaN | NaN | 1 | 81 |
| 599 | 7564825 | Logan Square 3 bedrms with parking | 6427776 | Thanh | Logan Square | 41.92654 | -87.72505 | Entire home/apt | 146 | 2 | 100 | 2019-12-01 | 1.60 | 1 | 180 |
| 778 | 9660334 | Vintage Charm Near Lake | 11520899 | Barbara | Rogers Park | 42.00451 | -87.66203 | Entire home/apt | 135 | 7 | 67 | 2020-08-03 | 1.17 | 1 | 73 |
| 1509 | 18059552 | Study Oasis | 124307753 | Aj | South Shore | 41.76867 | -87.56926 | Private room | 22 | 1 | 41 | 2020-09-06 | 1.06 | 2 | 319 |
| 3660 | 33810838 | 2F-2BR Apt in Bridgeport along Archer Ave.\n聚华坊 | 150847041 | Henry | Bridgeport | 41.84030 | -87.65903 | Entire home/apt | 118 | 2 | 20 | 2020-09-07 | 1.20 | 2 | 24 |
| 766 | 9505429 | Roscoe Village Inn|Walk to Chicago's Wrigley Field | 49260140 | Kimberly | North Center | 41.94629 | -87.68011 | Entire home/apt | 179 | 3 | 2 | 2016-06-12 | 0.04 | 3 | 1 |
| 4018 | 35926860 | Lux River North Studio w/ Gym, W/D, nr. Magnificent Mile, by Blueground | 107434423 | Blueground | Near North Side | 41.89502 | -87.62791 | Entire home/apt | 130 | 30 | 0 | NaN | NaN | 205 | 344 |
| 2416 | 24628815 | Chicago's Balloon | 62447270 | Fernanda | Bridgeport | 41.83363 | -87.65166 | Private room | 44 | 3 | 23 | 2020-08-03 | 0.79 | 1 | 89 |
| 4429 | 38192546 | Dog friendly, entire apt in trendy Logan Square. | 58057341 | Brett | Logan Square | 41.92595 | -87.69130 | Entire home/apt | 110 | 3 | 7 | 2019-10-28 | 0.55 | 1 | 0 |
| id | name | host_id | host_name | neighbourhood | latitude | longitude | room_type | price | minimum_nights | number_of_reviews | last_review | reviews_per_month | calculated_host_listings_count | availability_365 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2709 | 26776933 | Room D (Queen size bed) | 40832295 | Shannon & Carmen | Belmont Cragin | 41.92027 | -87.77840 | Private room | 115 | 2 | 6 | 2019-08-04 | 0.23 | 4 | 365 |
| 3418 | 32051074 | Character 2 Bedroom & Balcony-Heart of Chic Center | 62066342 | Rafaello | Near North Side | 41.89163 | -87.63800 | Entire home/apt | 109 | 120 | 0 | NaN | NaN | 5 | 365 |
| 1082 | 13798441 | Beautiful 1-bedroom, 1-bathroom Bucktown Apartment | 81125437 | Julie | Logan Square | 41.92309 | -87.68304 | Entire home/apt | 70 | 2 | 111 | 2020-09-13 | 2.17 | 1 | 81 |
| 4185 | 36910970 | Splendid Skyscraper in the Heart of Chicago | 260807399 | John | Near West Side | 41.88340 | -87.64307 | Entire home/apt | 400 | 2 | 3 | 2019-11-10 | 0.23 | 6 | 0 |
| 1356 | 16054326 | Brand New Logan Square Private Floor with Laundry! | 20852412 | Tawni | Avondale | 41.93312 | -87.70809 | Private room | 139 | 1 | 35 | 2020-02-17 | 0.87 | 1 | 109 |
| 187 | 2320027 | Carriage House in Wicker Park | 11848299 | Anton | West Town | 41.90669 | -87.68283 | Entire home/apt | 114 | 2 | 158 | 2019-09-16 | 1.97 | 1 | 0 |
| 3228 | 30171391 | Classic Chicago (entire ) 3 bed Row HousePARK FREE | 25715675 | Joyce | East Garfield Park | 41.87980 | -87.70136 | Entire home/apt | 79 | 4 | 89 | 2020-08-28 | 4.18 | 3 | 19 |
| 2192 | 22770779 | 3 Level Coach House in Roscoe Village with Patio! | 30320286 | Aqueel | North Center | 41.94383 | -87.68014 | Entire home/apt | 128 | 4 | 81 | 2020-09-05 | 2.81 | 1 | 344 |
| 638 | 7937969 | CHICAGO WRIGLEYVILLE CUBS HEADQUARTERS | 5547803 | Anne | Lake View | 41.95045 | -87.65558 | Entire home/apt | 125 | 1 | 3 | 2019-05-12 | 0.06 | 1 | 0 |
| 5690 | 42951790 | ❤️ Low Prices! Stylish 2BR Condo in Logan Square! | 341917367 | Joan | Logan Square | 41.92068 | -87.71599 | Entire home/apt | 106 | 4 | 2 | 2020-09-19 | 2.00 | 8 | 85 |